APPARATUS AND METHOD FOR STREAMING MULTIMEDIA DATA 

BACKGROUND OF THE INVENTION 

5 

This application claims the priority of Korean Patent Application No. 
2002-56487, filed on September 17, 2002, which is incorporated herein in its 
entirety by reference. 

10 1 . Field of the Invention 

The present invention relates to transmission of multimedia data, and 
more particularly, to a multimedia streaming apparatus and method by which 
multimedia data is adaptively transmitted according to the conditions of a 
network. 

15 2. Description of the Related Art 

Streaming is a technology by which data to be transmitted can be 
processed so that data transmission takes place endlessly and continuously. 
The streaming technology has been increasingly important in line with the 
growth of the Internet. This is because most users do not have Internet 

20 connection lines fast enough to rapidly download voluminous multimedia files. 
If the streaming technology is used, data can be presented by a client browser 
or plug-in even before full transmission of a file is not completed. 

However, in the streaming environment the conditions under which a 
network works are not always constant. Generally, even though multimedia 

25 streaming service is provided corresponding to an initial bandwidth of a network, 
if the number of service receivers increases, the bandwidth decreases and even 
in a worse case, a congestion of the network occurs such that stable service 
cannot be guaranteed. Accordingly, streaming service which changes the 
transmission rate correspondingly to the changes in the network is needed. 

30 Adaptive streaming is a technology which appropriately adjusts the 

amount of transmission data to changes in the conditions under which a 
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network works. Implementation examples of adaptive streaming include the 
following. 

U.S.Pat. No. 6,014,694, "System for adaptive video/audio transport over 
a network," granted to Aharoni et al. in January 2000, discloses a multiple bit 

5 stream service in which expected bit rates are divided into a plurality of levels, 
and multimedia streams corresponding to respective bit rates are generated 
and stored together. According to this method, a stream for each frame is 
stored or a stream corresponding to each bit rate level is separately stored, and 
then each server selectively streams a corresponding stream. However, this 

10 method has a shortcoming in that the size of a storage stream for servicing one 
multimedia content is big. 

U.S.Pat. No. 6,091,777, "Continuously adaptive digital video 
compression system and method for a web streamer," granted to Guetz et al. in 
July 2000, discloses a method for encoding data by adjusting the compression 

15 rate of an image when data is transmitted, considering the bandwidth of a 
channel and the conditions of resources of a client. However, this method 
adjusts the compression rate of a next frame by comparing the current 
bandwidth with the compression rate of the previous frame for every frame such 
that the amount of computation is great and overhead of a server is large. 

20 U.S.Pat. No. 6,181,711, "System and method for transporting a 

compressed video and data bit stream over a communication channelm," 
granted to Zhang et al. in January 2001, discloses a method by which the bit 
rate of pre-compressed data is converted and the data is again encoded 
corresponding to the bandwidth of a network and then transmitted. The bit rate 

25 conversion includes a decoding process, a bit rate conversion process, and an 
encoding process. According to the method, whenever the bandwidth of the 
network varies, the bit rate conversion should be performed through the above 
processes, thereby causing an increase on a load of a server. In addition, 
unless real time encoding is satisfactorily accomplished, a stable service cannot 

30 be guaranteed. 

SUMMARY OF THE INVENTION 
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The present invention provides a multimedia streaming apparatus and 
method by which optimal multimedia streaming service is provided by 
adaptively changing a transmission rate according to the bandwidth change of a 
network without imposing a burden on a server. 
5 According to an aspect of the present invention, there is provided a 

multimedia streaming apparatus comprising a multimedia streaming server 
which streams multimedia data corresponding to a predetermined quality of 
sen/ice (QoS) level in response to the parsing result of metadata corresponding 
to multimedia data desired to be provided for service and network bandwidth 
10 information which is input form the outside; and a multimedia streaming client 
which measures the bandwidth of a network to which the server is connected, 
by using a time when multimedia data is received and information on the size of 
the multimedia data, and transmits the measured bandwidth information to the 
server. 

15 According to another aspect of the present invention, there is provided 

a multimedia streaming server comprising a data storage unit which stores 
multimedia data desired to be provided for service and metadata related to the 
multimedia data; a metadata parsing unit which parses the metadata and 
outputs the parsing result in the form of a descriptor; a message receiving unit 

20 which receives network bandwidth information from the client; a quality of 
service (QoS) processing unit which selects a QoS level available for service in 
response to the descriptor information and the network bandwidth information, 
and extracts multimedia data corresponding to the selected QoS level, from the 
data storage unit; a buffer which stores the extracted data; a packet generation 

25 unit which packetizes the data stored in the buffer; and a packet transmission 
unit which transmits the data stored in the buffer to the client in each 
predetermined time interval. 

According to another aspect of the present invention, there is provided 
a multimedia streaming client comprising a packet receiving unit which receives 

30 the multimedia data from the server; a buffer which stores the received 
multimedia data; a multimedia decoder which reproduces the data stored in the 
buffer; a bandwidth measuring unit which measures a network bandwidth by 
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using the time when the multimedia data is received in the packet receiving unit 
and the size information of the data; and a message transmission unit which 
transmits the measured network bandwidth to the server so that the 
transmission rate of the multimedia data transmitted from the server is adjusted 
5 to the network bandwidth. 

According to another aspect of the present invention, there is provided 
a multimedia streaming method to be performed between a server and a client 
that are connected through a network, the method comprising (a) the client 
transmitting a service request message and session connection request 

10 message to the server; (b) transmitting a service confirmation message on the 
request message and a pair of dummy packets to the client; (c) in response to 
the pair of packets transmitted by the server, determining an initial bandwidth 
value of the network and transmitting the determined initial bandwidth value to 
the server; (d) comparing the initial bandwidth information transmitted by the 

15 client with descriptor information obtained as a result of parsing metadata, 
determining an appropriate QoS level, and starting to provide multimedia 
streaming service according to a transmission rate corresponding to the QoS 
level; (e) in response to packet information transmitted by the streaming service 
of the server, measuring the network bandwidth periodically and transmitting the 

20 measured bandwidth value to the server; and (f) according to the network 
bandwidth value transmitted by the client, extracting a predetermined 
multimedia stream, and transmitting the extracted multimedia stream to the 
client. 

According to another aspect of the present invention, there is provided 
25 a streaming method of a server which is connected to a client through a network, 
the method comprising (a) receiving the bandwidth of the network from the 
client; (b) based on a descriptor obtained as a result of parsing metadata 
corresponding to multimedia data desired to be provided for service, selecting a 
current time segment; (c) comparing a target bit rate defined in the descriptor for 
30 the selected segment with the network bandwidth, selecting a QoS level 
available for service; and (d) extracting frames corresponding to the selected 
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QoS level and transmitting the frames to the client at each predetermined time 
interval. 

According to another aspect of the present invention, there is provided 

a network bandwidth measuring method of a client which receives multimedia 

5 data from a server through a network, the method comprising (a) setting the 

size value of an accumulated packet to 0; (b) starting to receive a packet from 

the server; (c) setting the time when a first packet is received as T1 ; (d) after the 

first packet is input till a last packet is input, whenever a packet is input, 

accumulating the size value of the packet to the size of the accumulated packet; 

10 (e) if the last packet is input, setting the time when the last packet is input as T2; 

(f) measuring the network bandwidth by calculating 

Accumulated packet size x 1000x8 . , x r . 

; and (g) feeding the measured network 

TS2-TSI 

bandwidth information back to the server. 

15 BRIEF DESCRIPTION OF THE DRAWINGS 

The above objects and advantages of the present invention will become 
more apparent by describing in detail preferred embodiments thereof with 
reference to the attached drawings in which: 

FIG. 1 is a block diagram of a multimedia streaming server according to 
20 a preferred embodiment of the present invention, which provides adaptive 
multimedia streaming services in a variable network environment; 

FIG. 2 is a diagram of the structure of metadata according to a preferred 
embodiment of the present invention; 

FIG. 3 is a diagram of the structure of independent-type metadata 
25 according to a preferred embodiment of the present invention; 

FIG. 4 is a diagram of the structure of dependent-type metadata 
according to a preferred embodiment of the present invention; 

FIG. 5 is a diagram of the structure of metadata according to a preferred 
embodiment of the present invention for streaming a video stream and an audio 
30 stream at the same time by adjusting a transmission bit rate to the bandwidth 
change of a network; 



FIG. 6 is a block diagram of a multimedia streaming client according to 
a preferred embodiment of the present invention, which processes a multimedia 
packet transmitted by a server; 

FIG. 7 is a schematic diagram showing a multimedia streaming 
5 operation performed between the server of FIG. 1 and the client of FIG. 6; 

FIG. 8 is a flowchart showing a network bandwidth measuring method 
performed in the client shown in FIG. 6; 

FIG. 9 is a flowchart showing a multimedia streaming service process 
performed in the server shown in FIG. 1 ; 
10 FIG. 10 is a diagram showing an example of adaptively providing 

streaming service for multimedia data by adjusting a transmission bit rate to the 
bandwidth change of a network; 

FIG. 11 is a diagram showing the bit rate change of a multimedia stream 
by an adaptive streaming method according to an embodiment of the present 
15 invention when the level of quality of service (QoS) is set to 3; 

FIG. 12 is a diagram showing the change of a peak signal to noise ratio 
(PSNR) value when the QoS level is set to 3 as in FIG. 1 1 ; 

FIG. 13 is a diagram showing the bit rate change of a multimedia stream 
by an adaptive streaming method according to an embodiment of the present 
20 invention when the QoS level is set to 5; 

FIG. 14 is a diagram showing the change of a PSNR value when the 
QoS value is set to 5 as in FIG. 13; 

FIG. 15 is a diagram showing the bit rate change of a multimedia stream 
by an adaptive streaming method according to an embodiment of the present 
25 invention when the QoS level is set to 7; and 

FIG. 16 is a diagram showing the change of a PSNR value when the 
QoS value is set to 7 as in FIG. 1 5. 

DESCRIPTION OF THE PREFERRED EMBODIMENTS 
30 Referring to FIG. 1, the multimedia streaming server 100 according to 

the present invention comprises a data storage unit 101, a metadata parsing 
unit 110, a quality of sen/ice (hereinafter referred to as 'QoS') processing unit 
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130, a message receiving unit 160, a buffer 170, a packet generation unit 180, 
and a packet transmission unit. 

The data storage unit 101 stores compressed multimedia data intended 
to be provided for service, and metadata related to the multimedia data. Here, 
5 multimedia data is stored in the form of any one of audio data, moving picture 
data such as video data, still picture data, text data, and graphic data. The 
multimedia data is formed with a bit stream having any one of a spatial scalable 
function, a quality scalable function, a temporal scalable function, and a fine 
grain scalable (FGS) function. The metadata parsing unit 110 parses metadata 

10 and outputs the parsing result in the form of a descriptor. The QoS processing 
unit 130 performs QoS processing in response to the descriptor information of 
metadata and network bandwidth information. The message receiving unit 160 
receives information on conditions of a network 1 (that is, network bandwidth 
information) from a client. The buffer 170 is formed with two buffers, a packet 

15 storage buffer for storing a packet and a packet transmission buffer for 
transmitting a packet. The packet generation unit 180 packetizes data stored 
in the packet transmission buffer and the packet transmission unit 190 transmits 
data stored in the buffer 170 to the network 1 at a predetermined time interval. 

QoS processing by the QoS processing unit 130 will now be explained. 

20 First, a service level selection unit 140 receives a network bandwidth value from 
the message receiving unit 160, compares the bandwidth with a target bit rate at 
each QoS level defined in advance in the descriptor, and selects an available 
QoS level. Then, the QoS processing unit 130 extracts frames corresponding 
to the current QoS level and stores the frames in the buffer 170. 

25 Metadata 20 stores pointer information of frames corresponding to 

respective levels such that a file stored in the buffer 170 can directly access the 
metadata 20. The packet generation unit 180 packetizes data stored in the 
buffer 170 by cutting the data into packets of a predetermined size, and the 
packet transmission unit 190 transmits the data stored in the buffer to the 

30 network 1 at each predetermined time interval. When transmitting data, the 
packet transmission unit 190 transmits packets with identical intervals such that 
measuring the bandwidth in the client (300 of FIG. 6) can be performed 



accurately. A packet transmission interval and the size of a packet are 
adjusted to the average bit rate of data being provided for service. 

The metadata 20 is defined based on an extensible markup language 
(XML) and has scalability and compatibility according to the advantages of the 
5 XML. If a streaming service request is made between the server 100 and a 
client, the metadata 20 is parsed by the metadata parsing unit 110 and stored in 
the form of a descriptor so that the descriptor can be used internally in the 
server 100. 

FIG. 2 is a diagram of the structure of metadata according to a preferred 

10 embodiment of the present invention. In FIG. 2, boxes 21 through 26 indicate 
node objects and each line connecting nodes 21 through 26 indicates a 
hierarchical connection relation. The number next to each line connection part 
indicates cardinality that shows how many nodes relate to the node. Here, the 
number "1, 1" next to a line connection part indicates that the maximum number 

15 and the minimum number in the connection relation of the two nodes are 1 each, 
the number "0, *" indicates that the maximum number and the minimum number 
in the connection relation are 0 and infinity, respectively, and the number "1, *" 
indicates that the maximum number and the minimum number in the connection 
relation are 1 and infinity, respectively. 

20 For example, the relation between a STREAMING HINT node 21 and a 

HEADER GROUP HINT node 22, expressed as "1, 1", indicates that under the 
STREAMING HINT node 21 there should be only one HEADER GROUP HINT 
node 22 and should not be two or more nodes. The relation between the 
STREAMING HINT node and a SEGMENT GROUP HINT node 24, expressed 

25 as "1, *", indicates that under the STREAMING HINT node 21 there should be 
one or more SEGMENT GROUP HINT nodes 24 and there is no limit to the 
number of the SEGMENT GROUP HINT nodes 24. 

In FIG 2, the STREAMING HINT node 21 is a node at the top level and 
has attribute values specifying control types of the metadata 20 and a hierarchy 

30 type of the node. The control types specified in the STREAMING HINT node 
21 includes targetBitrateControl, targetQualityControl, targetComplexityControl, 
targetProfileControl, targetSpeedControl, targetDirectionContral and 
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targetDeviceControl. 

The targetBitrateControl is an attribute value for adjusting a 
transmission bit rate to the bandwidth change of a network, the 
targetQualityControl is an attribute value for adjusting a target quality of 
5 multimedia data to be provided, the targetComplexityControl is an attribute 
value for supporting differentiated services according to the conditions of 
resources of a client, the targetProfileControl is an attribute value for supporting 
differentiated services according to the compression format of multimedia data, 
the targetSpeedControl is an attribute value for adjusting a service speed 

10 according to a reproduction speed adjusting request from a client, the 
targetDirectionContral is an attribute for adjusting a service direction according 
to a reproduction direction adjusting request from a client, and 
targetDeviceControl is an attribute for supporting differentiated services 
according to the type of a client terminal. In addition, in the STREAMING HINT 

15 node 21 control types for QoS other than these control types can be additionally 
defined. According to the control attribute values of the STREAMING HINT 
node 21 the attribute of metadata is specified differently. 

The hierarch type of the STREAMING HINT node 21 is divided into an 
independent type and a dependent type, and according to the types, the 

20 structure of a lower level node varies. Particularly, a MEDIA SEGMENT HINT 
node 25 can build independent-type metadata or dependent-type metadata 
according to its structure. 

FIG. 3 is a diagram of the structure of independent-type metadata 
according to a preferred embodiment of the present invention. FIG., 4 is a 

25 diagram of the structure of dependent-type metadata according to a preferred 
embodiment of the present invention. 

Referring to FIG. 3, MEDIA SEGMENT HINT nodes 2511 through 2513 
have respective connection relations independent of each other. Meanwhile, 
FIG. 4 shows that a second MEDIA SEGMENT HINT node 2522 has a 

30 connection relation dependent on a first MEDIA SEGMENT HINT node 2521. 
Here, in the independent-type metadata structure each node has frame 
information of multimedia data corresponding to a service level without 
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reference to an upper node or reuse, while in the dependent-type metadata 
structure an upper node is referred to for information shared at each level and a 
lower node specifies only additional information. 

Again referring to FIG. 2, the STREAMING HINT node 21 is divided into 
5 the HEADER GROUP HINT node 22 having header information and the 
SEGMENT GROUP HINT node 24 which forms each segment information when 
multimedia data is divided into units of time, that is, into segments. 

The HEADER GROUP HINT node 22 has a number of FRAME 
HEADER HINT nodes 23 equal to the number of multimedia objects to be 
10 provided and each FRAME HEADER HINT node 23 has an attribute value 
indicating unique information of the node. 

Attribute values of the FRAME HEADER HINT node 23 includes 
streamed, streamType, scalability, frameRate, and avgBitrate. Among these 
values, streamed is a proper ID for identifying each multimedia data, and 
15 streamType is an attribute value indicating the type of multimedia and is divided 
into visual, audio, and other types. The scalability is an attribute value 
indicating the types of scalable functions and is divided into spatial, temporal, 
snr, and fgs (fine grain scalability) types. The spatial indicates a spatially 
scalable attribute value, the temporal indicates a temporally scalable attribute 
20 value, and the snr indicates an attribute value of quality scalability. The 
sourceLocator indicates location information of multimedia data stored in the 
data storage unit 101 of the server. The frameRate indicates the frame rate of 
multimedia data and the avgBitrate indicates the average bit rate of multimedia 
data. 

25 The SEGMENT GROUP HINT node 24 defines a divided multimedia 

stream in each time unit as a segment when an entire multimedia stream is 
divided into predetermined time intervals. The SEGMENT GROUP HINT node 
25 has a number of MEDIA SEGMENT HINT nodes 25 equal to the number of 
QoS levels for each segment. If the number of QoS levels increases, the 

30 number of MEDIA SEGMENT HINT nodes 25 increases such that the size of 
the metadata 20 increases but more detailed service can be provided. 

The MEDIA SEGMENT HINT node 25 has a level attribute indicating a 



10 



QoS level index, a numofFrames attribute value indicating the number of entire 
frames provided in the QoS level, and a targetBitrate attribute value indicating 
an average bit rate when a frame of the QoS level is provided for service. The 
MEDIA SEGMENT HINT node 25 includes at least one or more MEDIA FRAME 
5 HINT nodes 26 containing multimedia frame information to be actually 
transmitted, and has a number of MEDIA FRAME HINT nodes equal to the 
numofFrames attribute value. 

The MEDIA FRAME HINT node 26 has attribute values such as 
streamed, CTS, DTS, CodingType, frameOffset, frameLength, and frameNo. 

io The streamID is a proper ID for identifying each multimedia stream when a 
plurality of multimedia objects are provided for sen/ice and has the same value 
as the streamID of the FRAME HEADER HINT node 23. The DTS and the 
CTS indicate decoding time information and reproduction time information, 
respectively, of a frame. According to a frame reference method, frames are 

15 divided into I frames, P frames, and B frames when encoding and the 
CodingType indicates the type of a frame. The frameOffset indicates location 
information of multimedia data stored in the data storage unit 101 and the 
frameLength indicates the size of the frame. The frameNo indicates the 
number of the frame. 

20 These attribute values of the MEDIA FRAME HINT node 26 enables 

direct access to the multimedia data 10 stored in the data storage unit 101. If 
the data structure of this metadata is used, adaptive multimedia stream is 
available even when a plurality of multimedia streams should be provided at the 
same time for service. 

25 FIG. 5 is a diagram of the structure of metadata according to a preferred 

embodiment of the present invention for streaming a video stream and an audio 
stream at the same time by adjusting a transmission bit rate to the bandwidth 
change of a network. 

Referring to FIG. 5, each of MEDIA SEGMENT HINT nodes 2531 

30 through 2533 corresponding to respective QoS level of a unit segment has 
frame information of both video and audio streams, and distinguishes and 
accesses each multimedia data stream by using streamID attribute values of 
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MEDIA FRAME HINT nodes 2631 through 263n. For example, when streamID 
0 is defined as a video data stream and streamID 1 is defined as an audio data 
stream, any one of the multimedia data streams is accessed according to 
whether the streamID attribute value of the MEDIA FRAME HINT node 26 
5 indicates streamID 0 or streamID 1. Thus, when a plurality of streams are 
provided at the same time, if a multimedia frame to be provided for service is 
also defined in the metadata, a transmission rate can be adjusted to the 
bandwidth and synchronization among multimedia data can be successfully 
performed. 

io FIG. 6 is a block diagram of a multimedia streaming client 300 

according to a preferred embodiment of the present invention, which processes 
a multimedia packet transmitted by the server 100. Referring to FIG. 6, the 
multimedia streaming client 300 according to the present invention comprises a 
packet receiving unit 310, a buffer 320, a multimedia decoder 330, a bandwidth 

15 measuring unit 340, and a message transmission unit 350. 

The packet receiving unit 310 receives a multimedia stream from the 
server 100, and the buffer 320 stores the received multimedia stream data. 
The multimedia decoder 330 reproduces the data stored in the buffer 320, and 
the bandwidth measuring unit 340 measures a network bandwidth by using the 

20 time when the packet receiving unit 310 receives a multimedia packet and 
information on the size of a packet. 

When transmitting data, the server 100 transmits all packets in the 
buffer 170 in units of packet groups in a predetermined time interval and when 
transmitting a packet, transmits a packet number together. The packet 

25 receiving unit 310 distinguishes the first packet and the last packet in a packet 
group by using the packet numbers transmitted by the server 100. Assuming 
that a time when a first packet is received is t1 , a time when a last packet is 
received is t2, and the size of data of a packet group is Sp, the bandwidth of a 
network can be obtained by the following equation 1: 

30 

S x 8x1000 
E BW (Bandwidth) = (1) 
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Here, the unit of time is a millisecond (ms), the unit of the size of data is 
a byte, and the measuring unit of a bandwidth is bits per second (bps). A 
measured bandwidth is periodically fed back from the message transmission 
unit 350 to the server 100 whenever there is a change in the bandwidth. 
5 FIG. 7 is a schematic diagram for explaining a multimedia streaming 

operation performed between the server 100 shown in FIG. 1 and the client 300 
shown in FIG. 6. Referring to FIG. 7, in the adaptive multimedia streaming 
method according to the present invention, a data transmission rate between 
the server 100 and the client 300 is adjusted to the bandwidth of the network by 

10 using metadata when data is transmitted. For this, the client 300 first transmits 
a service request and session connection request to the server 100 (reference 
number (1)). The server 100 confirms the service request from the client 300 
(reference number (2)), and a service confirmation message and a pair of 
dummy packets to the client 300 (reference number (3)). 

15 The client measures an initial bandwidth (reference number (4)) in 

response to the pair of packets transmitted by the server 100. The sizes of the 
two packets transmitted by the server 100 are set to the same as the size of a 
packet unit, by which size multimedia data is divided into packets when the 
packet generation unit 180 of the server 100 packetizes the multimedia data. 

20 The transmission interval of the two packets is also set to the same as the 
transmission interval of the multimedia data. After receiving the dummy packet 
pair from the server 100, the client 300 determines an initial bandwidth value by 
using the equation 1 and transmits the value to the server 100 (reference 
number (5)). 

25 When the client 300 measures and transmits the initial bandwidth, the 

server 100 parses metadata, stores the parsed result in the form of a descriptor, 
compares the initial bandwidth with the descriptor information, determines an 
appropriate QoS level, and begins to provide multimedia streaming service 
(reference number (6)). Using the packet information transmitted by the server 

30 100, the client 300 periodically measures the bandwidth of the network 
(reference number (7)) and transmits the measured bandwidth value to the 
server 100 (reference number (8)). The server 100 extracts a predetermined 
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multimedia stream according to the bandwidth value transmitted by the client 
300 (reference number (9)) and transmits the extracted multimedia stream to 
the client 300 (reference number (10)). A data transmission process 
performed between the multimedia streaming server 100 and the client 300 at 
this time will now be explained. 

As described above, according to the purpose of a buffer, the buffer 170 
is broken down into the packet storage buffer for storing a packet and a packet 
transmission buffer for transmitting a packet. While the packet transmission 
unit 190 transmits packets stored in the packet transmission buffer, the QoS 
processing unit 130 stores frames corresponding to the QoS level in the packet 
storage buffer. At this time, the packet transmission is carried out in each 
predetermined time interval. If it is a time to transmit a next packet after a 
packet is transmitted and a predetermined time passes, the packet transmission 
unit 190 uses the previous packet storage buffer as a packet transmission buffer 
for transmitting a present packet and uses the previous packet transmission 
buffer that is empty after transmitting the previous packet, as a packet storage 
buffer for storing a packet. According to this recursive management of the 
buffer 170, continuous streaming can be performed while minimizing the 
changes in the conditions of the network. 

The packet transmission unit 190 transmits all the packets stored in the 
packet transmission buffer in each predetermined time interval and the packets 
transmitted at this time are defined as a packet group. Packets in the packet 
group unit have packet numbers which indicate the order of the packets in the 
transmission. Based on the packet numbers, the client 300 determines the 
order of packets and the beginning and end of the packet group, and measures 
the bandwidth in units of packet groups. The process for measuring a 
bandwidth in the client 300 will now be explained. 

FIG. 8 is a flowchart showing a network bandwidth measuring method 
performed in the packet receiving unit 310 of the client 300 shown in FIG. 6. 

Referring to FIG. 8, the packet receiving unit 310 initializes an 
accumulated packet size value which indicates the entire size of a received 
packet, by setting the value to 0 in step 3110 and receives a packet in step 3120. 
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If the packet receiving unit 310 receives a packet, a header and data are 
separated in the packet and from the header the packet number is obtained. 
Using the packet number, it is determined whether or not the received packet is 
the first packet in step 31 30. 
5 If the determination result of the step 3130 indicates that the received 

packet is the first packet, the receiving time of the first packet is set as TS1 in 
step 3140 and the step 3120 is performed again to continuously receive a 
packet. Then, it is determined whether or not the received packet is the first 
packet in step 3130, and if the determination result indicates that the received 

io packet is not a first packet, it is determined whether or not the packet is the last 
packet in step 3150. 

If the determination result of the step 3150 indicates that the received 
packet is not the last packet, the size value of the current packet is accumulated 
to the size value of the existing accumulated packet and the step 3120 is 

15 performed again to continuously receive a packet. Then, it is determined 
whether or not the received packet is the first packet in step 3130, and if the 
determination result indicates that the received packet is not a first packet, it is 
again determined whether or not the packet is the last packet in step 3150. 

If the determination result of the step 3150 indicates that the received 

20 packet is the last packet, the receiving time of the last packet is set as TS2 in 

step 3170. Then, by plugging in the receiving time of the first packet TS1, the 

receiving time of the last packet TS2, and the size value of the accumulated 

packet into the equation 1, that is, by calculating 

Accumulated packet size x 1000 x 8 r . . , . 
- , the bandwidth of the network is measured in 

TS2 - TS\ 

25 step 3180. 

FIG. 9 is a flowchart showing a multimedia streaming service process 
performed in the server 100 shown in FIG. 1. Referring to FIG. 9, first, the 
server 100 receives the bandwidth of the network from the client 300 in step 
1100, and selects a current time segment from the descriptor generated based 
30 on the metadata in step 1200. Then, by comparing the bandwidth with a target 
bit rate defined in the descriptor for the selected segment, the server 100 
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selects a QoS level available for service in step 1400. The number of QoS 
levels is determined when the metadata is defined and the target bit rate of 
each level is determined based on an average bit rate of multimedia data. 
Since the metadata contains information on frames corresponding to each level, 
5 if the QoS level is determined, only the frames corresponding to the selected 
level are extracted in step 1500 and stored in the buffer in step 1700. Then, 
through the packet transmission unit 190, data stored in the buffer 170 is 
transmitted to the client 300 in each predetermined time interval. 

FIG. 10 is a diagram showing an example of adaptively providing 

10 streaming service for multimedia data by adjusting a transmission bit rate to the 
bandwidth change of a network. 

Referring to FIG. 10, for example, if the bandwidth of the current 
network detected by the client 300 is 400 kbps, the server 100 selects an 
appropriate QoS level from the metadata. As shown in FIG. 10, when the 

is bandwidth of the network is 400 kbps, the QoS level is formed by 3 classes and 
at this time each of MEDIA SEGMENT HINT nodes 2541 through 2543 
indicates a QoS level. For example, assuming that targetBitrate value, i.e., 
one of the attributes of MEDIA SEGMENT HINT nodes 2541 through 2543, is 
level 1, the bit rate is set to 192 kbps, if the value is level 2, the bit rate is set to 

20 356 kbps, and if the value is level 3, the bit rate is set to 689 kbps, .level 2 that 
has a target bit rate value closest to the current bandwidth (that is, 400 kbps) is 
selected as the QoS level. 

The MEDIA SEGMENT HINT nodes 2541 through 2543 have MEDIA 
FRAME HINT nodes 2641, 2642, ... , 264m in respective subordinate levels 

25 and each of the MEDIA FRAME HINT nodes 2641, 2642, ... s 264n stores 
information on frames that can support the corresponding target bit rate. In 
this case, the bandwidth of the network may not be fixed to 400 kbps and may 
vary. If the bandwidth varies, then a QoS level to be selected also varies 
accordingly. Therefore, if the server 100 selects frames corresponding to the 

30 QoS level (for example, QoS level 2) appropriate to the bandwidth of the current 
network and transmits the frames, the data transmission rate is adjusted to the 
bandwidth of the current network such that adaptive streaming can be achieved. 
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FIG. 11 is a diagram showing the bit rate change of a multimedia stream 
by an adaptive streaming method according to the present invention when the 
QoS level is set to 3. The graph shown in FIG. 11 is the result of adaptive 
streaming when sequence data with a common intermediate format (CIF) size 
5 of 352 x 288, a frame rate of 30 fps and an average bit rate of 658 kbps is used. 
The graph shows the bandwidth of the network (net_bw), the bit rate before 
adjusting (org_bitrate), and the adjusted bit rate (adt_bitrate) according to the 
present invention with the lapse of time when the QoS level is divided into three 
classes. 

10 Referring to FIG. 11, the network bandwidth (net_bw) gradually 

decreases in interval A, reaches a minimum in interval B, and recovers in 
interval C. At this time, since the original bit rate (org_bitrate) of the data 
before adjusting is less than the network bandwidth (net_bw) in the interval A, 
bit rate adjusting by QoS level does not occur. Likewise, since the original bit 

15 rate (org_bitrate) of the data before adjusting is less than the network bandwidth 
(net_bw) in the interval C, bit rate adjusting by QoS level does not occur. 
However, since the original bit rate (org_bitrate) of the data before adjusting is 
greater than the network bandwidth (net_bw) in the interval B, if this state 
continues for a long time, network congestion occurs and packets may be 

20 dropped during transmission. Therefore, in the present invention the bit rate is 
adjusted to the network bandwidth (net_bw) according to the QoS level in the 
interval B such that packet drop, which can occur during transmission, can be 
prevented. 

FIG. 12 is a diagram showing the change of a peak signal to noise ratio 
25 (PSNR) value when the QoS level is set to 3 as in FIG. 11. 

PSNR is obtained by the following equations 2 and 3: 



RMSE = U-'^Ysif^J) ~ fH J)] 2 (2) 

y MN ttpo 

30 The equation 2 is originally an equation for calculating the root mean 
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square error (RMSE) of a decoded image for an original image. Here, f{ij) 
denotes the pixel value of the original image and f(i,j) denotes the pixel 
value of the decoded image. M and N denote the number of pixels of the 
original image and that of the decoded image, respectively. 
5 If the RMSE of the decoded image for the original image is obtained 

according to the equation 2, PSNR value is obtained by the following equation 
3: 



PSNR = 20log x J 255 ) (3) 
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The part where the PSNR value dramatically decreases in the graph of 
FIG. 12 indicates the part where a frame drop occurs for adjusting the bit rate. 
At this time, the average PSNR is 34.87 dB. 

FIG. 13 is a diagram showing the bit rate change of a multimedia stream 

15 by an adaptive streaming method according to the present invention when the 
QoS level is set to 5. FIG. 14 is a diagram showing the change of a PSNR 
value when the QoS value is set to 5 as in FIG. 13. The graphs of FIGS. 13 
and 14 show the results of experiments performed under the same conditions 
as the experiments of FIGS. 11 and 12. 

20 Referring to FIG. 13, the adjusted bit rate (adt_bitrate) of data when the 

QoS level is set to 5 is provided more appropriately to the change of the 
network bandwidth (net_bw) compared to that of FIG. 11 when the QoS level is 
set to 3. Therefore, in the frame drop rate and average bit rate, the adjusted 
bit rate when the QoS level is set to 5 can get more improved results than the 

25 adjusted bit rate when the QoS level is set to 3. 

Next, referring to FIG. 14, it is shown that the part where the PSNR 
value is dramatically lowered shrinks in the graph of FIG. 14 compared to the 
part in FIG. 12. This means that the part where frame drop occurs shrinks. 
When the QoS level is set to 5 as FIG. 14, the average PSNR is 35.57 dB. 

30 FIG. 15 is a diagram showing the bit rate change of a multimedia stream 
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by an adaptive streaming method according to the present invention when the 
QoS level is set to 7, and FIG. 16 is a diagram showing the change of a PSNR 
value when the QoS value is set to 7 as in FIG. 15. The graphs of FIGS. 15 
and 16 show the results of experiments performed under the same conditions 

5 as the experiments of FIGS. 11 and 12. 

Referring to FIG 15, the adjusted bit rate (adt_bitrate) of data when the 
QoS level is set to 7 is provided more appropriately to the change of the 
network bandwidth (net_bw) compared to that when the QoS level is set to 3 or 
5. Therefore, in the frame drop rate and average bit rate, the adjusted bit rate 

10 when the QoS level is set to 7 can get more improved results than the adjusted 
bit rate when the QoS level is set to 3 or 5. 

Next, referring to FIG. 16, it is shown that the part where the PSNR 
value is dramatically lowered shrinks in the graph of FIG. 16 compared to the 
parts in FIG. 12 and FIG. 14. This means that the part where frame drop 

15 occurs shrinks greatly. When the QoS level is set to 7 as FIG. 16, the average 
PSNR is 35.89 dB. 



The results obtained in FIGS. 11 through 16 can be centralized as in the 
following Table 1 : 

Table 1 



Number of QoS levels 


Frame drop rate (%) 


Average bit rate (bps) 


Average PSNR (dB) 


3 


19.3 


583.596 


34.87 


5 


12.7 


608.964 


35.57 


7 


9.6 


621.112 


35.89 



20 

As shown in the Table 1 and FIGS. 11 through 16, the present invention 
can provide services more appropriately to the change of the network 
bandwidth as the number of QoS levels increases (that is, as the QoS level is 
divided more finely). As the number of QoS levels increases, the frame drop 
25 rate decreases and the average bit rate and the average PSNR increase. 
Accordingly, as the number of QoS levels increases, more detailed control of a 
transmission rate is enabled. Therefore, the multimedia streaming apparatus 
and method according to the present invention can provide multimedia 
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streaming services adaptively according to the change of the network 
bandwidth without imposing a burden on the server. 

As described above the adaptive streaming apparatus and method 
according to the present invention can provide optimum streaming services 
5 regardless of the condition of a network and the type of a terminal that receives 
the sen/ices. If only metadata is specified together with multimedia data, the 
apparatus and method can be applied regardless on the format of contents to 
be delivered, and compared to prior art method, the burden on the server 
decreases in streaming. Also, the present invention can be applied to data 

10 streaming in a wireless communications network as well as in a wire 
communications network. 

The present invention may be embodied in a code, which can be read 
by a computer, on a computer readable recording medium. The computer 
readable recording medium includes all kinds of recording apparatuses on 

15 which computer readable data are stored. 

The computer readable recording medium includes storage media such 
as magnetic storage media (e.g., ROM's, floppy disks, hard disks, etc.), optically 
readable media (e.g., CD-ROMs, DVDs, etc.) and carrier waves (e.g., 
transmissions over the Internet). Also, the computer readable recording 

20 medium can be used via computer systems connected through a network and 
can be stored and executed in a distributed mode. 
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